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SUPPLEMENTAL REPLY TO NON-FINAL ACTION 



The Applicant hereby supplements the reply to the Official Action filed May 5, 
2006 with the following additional remarks. 

The following remarks follow the first paragraph on page 15 of the reply. 

As a further explanation of the advantages provided by the Applicant's claimed 
invention, the Examiner is referred to the article entitled "The stability of multichannel 
sound systems with frequency shifting," J. Acoustical Society of America, vol. 116, 
No. 2, pp 853 - 871, 2004. A copy is enclosed. 

The enclosed document describes Monte Carlo simulations of the stability of N- 
channel systems with unitary and nonunitary reverberators (same results for early 
reflections) without frequency shifting (Figs. 16 and 17 and Tables I and II). The loop 
gain per channel for a 50% risk of instability for N=8 channels is -1 5.2 dB for a 
nonunitary system (with behaviour like that of a room), and is -13.3 dB for a unitary 
system. In other words, the Applicant's claimed system provides 2dB more stability 
margin. 2dB more loop gain is significant in a sound system. 

For a lower number of channels the difference is greater. For N=2, the loop 
gain for 50% risk is -10.2 dB for the unitary case and -13.6 dB for the non-unitary case. 
That is, the nonunitary system must operate with 3.4 dB less loop gain to remain stable. 

These simulations clearly indicate the advantages of the Applicant's claimed 

system. 
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CONCLUSION 

The Examiner is respectfully requested to consider the foregoing additional 
remarks and the enclosed document when reconsidering the rejection of the 
claims presented in this application. 
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The stability of multichannel sound systems with frequency 
shifting 

Mark A. Poletti 

Industrial Research Ltd., P.O. Box 31-310, Lower Hurt, New Zealand 

(Received 21 September 2003; revised 4 April 2004; accepted 2 May 2004) 

Time-varying components are used in some multichannel sound systems designed for the 
enhancement of room acoustics. Time-variation can usefully reduce the risk of producing ringing 
tones and improve stability margins, provided that any modulation artefacts are inaudible. 
Frequency-shifting is one form of time-variation which provides the best case improvement in loop 
gain, and for which the single channel stability limit has been derived. This paper determines the 
stability limit for multiple channel systems with frequency-shifting by generalizing the previous 
single-channel analysis. It is shown that the improvement in stability due to frequency-shifting 
reduces with the number of channels. Simulations are presented to verify the theory. The stability 
limits are also compared with those of time-invariant systems, and preliminary subjective 
assessments are carried out to indicate useable loop gains with frequency-shifting. © 2004 
Acoustical Society of America. [DOI: 10.1121/1.1763972] 

PACS numbers: 43.38.Tj, 43.55.Jz [MK] Pages: 853-871 



I. INTRODUCTION 

Sound systems are limited in the maximum amplifica- 
tion they can provide by regenerative feedback of sound 
from the loudspeakers to the microphones, which causes in- 
stability at high loop gains. For single channel systems with 
flat transfer functions in their electronic processors, the mean 
loop gain fx must be kept below — 1 2 dB to avoid ringing 
tones or instability. 1,2 For systems with TV broadband inde- 
pendent channels the total mean loop power gain jjl 2 N rises 
with the number of channels, allowing a greater enhance- 
ment of early energy and reverberation time (RT). 3 

A common application of multichannel sound systems is 
in room acoustic enhancement systems. 1 ' 4-7 For systems 
without electronic reverberation devices, the room reverbera- 
tion time gain is approximately equal to the steady state 
sound intensity gain r, 1 



Multichannel acoustic enhancement systems aim to provide 
sound quality which is indistinguishable from a natural room 
with enhanced acoustic properties. Therefore, they must pro- 
vide a more subtle sound enhancement than that produced by 
typical sound systems, and with a lower risk of unnatural 
artefacts. For example, the tolerance of ringing tones from a 
sound system is lower in a classical music concert than it 
would be at a rock concert. For this reason, eliminating ring- 
ing tones, and other more subtle colouration effects that 
might occur at lower loop gains, is of primary importance. 
The use of multiple channels allows increased power gains 
with a reduced risk of colouration artefacts. Nevertheless, 
unnatural artefacts can still occur at high loop gains. 

Aside from multiple channels, there are three other 
methods available for achieving increased loop gains and 
reducing unnatural artefacts. The first is the use of 
equalization. 8-13 Broadband equalization allows the loop 



gain to be maximized across all frequencies. Narrowband 
equalization allows individual ringing tones to be eliminated, 
but has the risk that the ringing frequency can change with 
small changes in microphone positions or room characteris- 
tics. A more stable approach is the use of adaptive filters 
which lock onto and null ringing tones. 14-16 These systems 
must distinguish between musical tones and ringing tones, 
but several commercial devices are available which suggest 
that this is feasible. A more comprehensive adaptive strategy 
is to have a filter adapt to the inverse of the loop transfer 
function, which renders the loop transfer function a simple 
delay. Alternatively, echo cancellation methods can in prin- 
ciple allow the cancellation of the loudspeaker to micro- 
phone signal, eliminating feedback. 17 These adaptive filter 
strategies rely on the robustness of an adaptive algorithm to 
allow the sound system to operate well above the naturally 
occurring stability limit. 

For multichannel systems, both inverse and echo- 
cancellation methods require the measurement of the loop 
transfer function matrix which is problematic due to the large 
amount of memory and processing required. In addition the 
inverse equalization method requires calculation of the in- 
verse of the matrix, which imposes a further large processing 
burden. 

The second method for increasing loop gains is to em- 
ploy artificial reverberation devices to provide greater rever- 
beration gain for a given loop gain. 7, 18-21 Most of these ap- 
plications are in in-line systems, where the microphones are 
close to the sound sources. In Ref. 7 it is shown that a rever- 
berator in a non-in-line system is the electroacoustic equiva- 
lent of a coupled room and that high reverberation gains are 
possible at moderate loop gains, provided that the loop gain 
is high enough to prevent double-sloping effects. It has also 
been shown that sound system stability may be improved by 
using unitary reverberators which produce a constant power 
gain with frequency, further reducing colouration effects. 22 

The third approach to controlling colouration is the use 
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of time-varying systems, which slowly change their transfer 
functions with time to prevent the build-up of ringing 
tones. 23-29 These systems are not as sensitive to changes in 
the room acoustics as narrowband equalization and are inher- 
ently more robust, and simpler, than adaptive systems. 

There are several methods of producing time variation, 
such as delay modulation, phase- and frequency-modulation 
and frequency-shifting. Delay modulation provides a fre- 
quency shift which rises linearly with frequency. Therefore, 
it produces a reduced risk of coloration at high frequencies, 
but may not improve performance at tow frequencies. 30 In 
some cases time-varying delays are incorporated into a digi- 
tal reverberation device which produces a more complicated 
phase and amplitude variation with frequency. 

Phase-modulation alters the signal phase at each fre- 
quency without affecting the amplitude. It can be carried out 
using time-varying allpass filters, but these produce a phase 
deviation that varies with frequency. A more precise method 
of phase modulation is achieved by taking the analytic signal 
and modulating it with a complex modulation of the form 
exp[/0(O]- Tnis provides the same phase variation at all fre- 
quencies and a more consistent control of feedback across 
the signal bandwidth. If the complex phase function is ana- 
lytic, then all components of the signal are shifted by posi- 
tive frequency shifts. 31 

Frequency-shifting (FS) is a special case of analytic 
phase-modulation using a linear phase sinusoid. The input 
signal is made analytic by eliminating the negative frequen- 
cies and the frequency shifted output is the real part of the 
modulated signal. This single sideband technique shifts the 
positive frequencies in the signal by w 0 and the negative 
frequencies by — a> 0 , 32 

The stability of sound systems with frequency-shifting 
has been investigated by Schroeder. 23 He developed methods 
for determining the maximum loop gain possible in a single 
channel system with and without frequency-shifting. Without 
frequency-shifting, his loop gain for a 50% risk of instability 
was - 8 dB for a reverberation time, bandwidth product of 
1 0 000. With frequency-shifting the loop gain could theoreti- 
cally be increased to 2.5 dB, an increase of 10.5 dB. How- 
ever, at this value the sound quality was found to be strongly 
affected by modulation artefacts. In practice, Schroeder de- 
termined that a 2 dB stability margin was required without 
FS, and a 6 dB margin was required with FS in order to 
avoid audible beating effects. The net benefit due to FS was 
then about 6 dB. 

Nielsen and Svensson have recently considered the per- 
formance of periodic phase, frequency and delay modulation 
in single channel systems. 29 They review the theory of 
frequency-shifting and show that it is the most efficient 
method for controlling regenerative feedback since it com- 
pletely smooths the loop gain and provides infinite carrier 
suppression. 

While frequency-shifting provides the maximum pos- 
sible loop gain increase, it may not produce subjectively ac- 
ceptable performance for music applications. There are two 
reasons. First, the frequency shift is a significant fraction of a 
semitone at low frequencies. For example a 5 Hz shift is 
about a semitone at 82 Hz (low E on a guitar). Delay modu- 
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lation avoids this issue but has poor stability control at low 
frequencies. 29 Secondly, the frequency shift is strictly posi- 
tive each time the signal goes around the feedback loop (for 
positive modulation frequencies). The subjective effect at 
high loop gains is that the signal frequency rises with time 
which is subjectively unacceptable. However, other time- 
variation components which produce both positive and nega- 
tive frequency shifts exhibit other problems. 29 Systems 
which have nonzero carrier suppression (a nonshifted com- 
ponent) provide a reduced stability improvement, and those 
which produce both positive and negative sidebands produce 
a net shift of zero after two or more times through the feed- 
back loop, which also limits stability. 

Nielsen and Svensson suggest that time- variation may 
be more successful in room enhancement systems where the 
sound enhancement is more subtle than that provided by 
sound reinforcement systems. 29 For example, frequency- 
shifting may offer useful control of ringing and improve the 
loop gain provided that the shift is small enough to avoid 
low- frequency problems and the loop gain increase is not too 
great. 

The analysis of time-varying systems has to date con- 
centrated on single channel systems, although some experi- 
ments on a four-channel time-varying system are reported in 
Ref. 30. However, no analysis of the theoretical performance 
gains that can be achieved using time-variation in multichan- 
nel systems has been carried out. 

In this paper, an analysis of frequency-shifting in multi- 
channel systems is undertaken. Frequency-shifting is the 
simplest form of time-variation to analyze and, as discussed 
above, it provides the best-case increase in loop gain. The 
analysis of the multichannel case follows that in Schroeder's 
paper and Ref. 29. The multichannel extension of this work 
is based on a norm approximation for the total power gain of 
the loop transfer function matrix. 

Two forms of simulations are carried out to verify the 
theory. Monte Carlo stability simulations are used first to 
verify the theoretical loop gain limits, and then a time- 
varying system simulator based on digital reverberators is 
used to allow a more detailed investigation of the behavior of 
sound systems with frequency-shifting. 

Finally, the frequency-shifting stability limits are com- 
pared with the stability limits for time-invariant systems and 
subjective tests are carried out in order to estimate the prac- 
tically useable loop gain for both cases. The practical in- 
crease in loop gain brought about by frequency-shifting is 
then estimated. 

II. THEORETICAL STABILITY LIMITS 
A. Review of power gains 

The analysis of multichannel frequency-shifting is based 
on the power gain of the loop transfer function matrix. The 
power gain is therefore briefly reviewed here. 

Consider a room containing TV loudspeakers and micro- 
phones. Assume that the distances between loudspeakers and 
microphones are large enough so that the reverberant sound 
transmission dominates the direct sound. Each transfer func- 
tion H nm (u}) then has real and imaginary parts which are 
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normally distributed (with transducer positions) with zero 
mean and equal variances <j) — cr\P The magnitude 
|A/ wm (aj)| is Rayleigh distributed. 

If a signal with power spectral density (psd) S uu (oj) is 
input to H n „,(co), the output psd is 33 

S vu (a>) = \h„ m (<0)\ 2 S ull (<o). (2) 

The power gain at each frequency is then |//„ m (oj)| 2 . 

The mean power gain may be found as the expected 
value of | /-/,„„( (o) | 2 over the ensemble of all possible micro- 
phone and loudspeaker positions. \lf nm (to)\ 2 = H; }mR (io) 
-hH 2 nmf ((o) is ^-squared distributed with two degrees of free- 
dom and so 34 



E{\H„ m (a>)\ 2 }=2cr 2 R . 



(3) 



Assume further without loss of generality that the mean 
power gain of each transfer function is unity (0-^= 1/2), and 
that the output of each microphone preamplifier is attenuated 
by a factor fx to control stability. 

Consider now the matrix of all transfer functions, H. 
Assume that N uncorrected signals with the same power 
spectrum S ul ,(<*>) are applied to the N inputs of H. (This is 
realistic if the microphones receive predominantly reverber- 
ant energy and have the same sensitivities and preamplifier 
gains.) The power at each output of H is then the sum of the 
powers arriving from each input 

N 

(4) 



m = I 



and the total output power is 

N A' 

2 2 \H nm \ 2 S, ltt (<o) = \\H\\ 2 S,M, (5) 

ni = I n — 1 

where || || denotes the Frobenius norm of H. The total input 
power is NS (lu (iii). Therefore the total power gain is 



(6) 



The expected value of the power gain is, with the assump- 
tions above 



E{r H }=-N 2 = N. 
This is scaled by fx 2 to control stability. 



(7) 



B. Frequency-shifting and analytic signal analysis 

In frequency-shifting a real signal s(t) is made complex 
by deriving an imaginary part £(/) which is the Hilbert trans- 
form of 5(/), 33 

s a (t) = s(t)+jHt). (8) 

This analytic signal has no negative frequency components. 
Frequency-shifting is carried out by multiplying the analytic 
signal by exp(/o> 0 /) and taking the real part of the result 

Re{5 0 (/)^ a, o'}- 5 (/)cos(w 0 0-^(Osin(a> 0 /), (9) 

which reintroduces negative frequency components such that 
the spectrum is Hermitian. 33 



U(<o) 




V(Q) 



FIG. 1. Multichannel sound system with frequency-shifting, assuming an 
analytic input signal. 

The analytic signal is often used in the analysis of time- 
invariant physical systems, because any real, physical signal 
is simply the real part of the associated analytic signal. The 
same approach is followed here. However, in the case of 
time-varying systems, the analytic signal is modulated and 
this modulation may produce negative frequency compo- 
nents. In frequency-shifting, this occurs for large, negative 
frequency shifts. In the application of interest here the fre- 
quency shift is small, and the room transfer functions have 
magnitudes that reduce to zero at zero Hz (since the loud- 
speaker responses are zero at dc). Therefore the modulated 
analytic signal will also remain analytic for negative fre- 
quency shifts. Hence, the analysis of stability with 
frequency-shifting may be carried out using analytic signals, 
and the spectrum of the output signal for the associated real 
signal is simply the Hermitian part of the analytic signal 
output. 

C. Multichannel systems with frequency-shifting 

Consider the regenerative system shown in Fig. 1. The 
transfer function matrix from the N loudspeaker inputs to the 
N microphone outputs is H(co). For simplicity, we have omit- 
ted the use of separate input and output transfer functions as 
in Ref. 2, since this has no effect on the stability. The input 
signals are applied directly to the TV system loudspeakers. 
The stability of the system will be examined by considering 
the vector of spectra at the input to the room, K. In the time 
invariant case, this is given by 

r=[I- y u,XH]- I t/-H- i [I-/xHX]- I Ht/. (10) 

The stability of the time-invariant system is governed by the 
scaled eigenvalues of XH, which are the same as those of 
HX. 35 The output vector is V=HY which has the same sta- 
bility criteria as Y. 

For the time-variant case, we assume for simplicity of 
analysis that all microphones signals are frequency shifted 
by the same modulation function exp[/a> 0 f]. As discussed 
above, we assume an input signal U(a>) which is analytic. 
The vector of spectra Y(<o) at the inputs to the room with 
frequency-shifting may then be written 

Y(co) = U(to) + /uX(a)- co Q )l\(co~ co 0 )Y(w- w 0 ) (11) 
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Substituting for X(ct>-a> 0 ) from the same equation 

Y(co) = U(co) + /ulX(o>- w 0 )H(o> - w 0 )[ U(co~ <o q ) 

+ ^X(w-2w 0 )H(w-2w 0 ) Y(oj-2oj 0 )] 

— U((o)~h /xX(cj — co 0 )H{co- co 0 )U((o— o) 0 ) 

+ /a 2 X(u>- u) 0 )H((o~ oj 0 )X(oj — 2o) 0 ) 

XU(co-2co 0 )Y(w-2w Q ). (12) 

This procedure may be continued, and the vector of 
spectra is given by the infinite series 

Y(o))= U(oj)± fxX{<o — a) 0 )H(co— oj 0 )U((o — to 0 ) 

+ /x 2 X(oj- cl> 0 )H(o>— co 0 )X(ct> — 2a> 0 )H(o>— 2w 0 ) 



X <y(w-2a> 0 )--- + Ai A 



II X(<o-ma> Q ) 

m= I 



X H( o) — mco 0 ) 



L/(w-Mw 0 ) + ' 



(13) 



If the system is to be stable the output power must be finite 
for a finite input power. The total power in all channels that 
contributes to / after M iterations through the feedback loop 
is the total input power multiplied by t!^e power gain 



r A ,= 



TV 



Y[ X(co — mco 0 )H(<o — mco 0 ) 



(14) 



D. Stability analysis for unitary systems 

We consider first the case where there is no second 
transfer function matrix X, and each output is connected di- 
rectly via a frequency-shifter and loop gain to the corre- 
sponding input. The power gain becomes 

M II 2 

n H(fti-iwwo) . 0 5 ) 

m= 1 



This expression is more problematic than the single 
channel case, because the squared norm of the product of 
matrices is not equal to the product of the squared norms 
unless TV = 1. It is known that for products of matrices, the 
following inequality holds: 35 

2 M 

<I1 ||H(a>-/n<w 0 )l| 2 , (16) 



E[ H(<o~mto 0 ) 

m = 1 



but this does not provide a tight enough bound to allow the 
stability to be accurately determined. However, for the case 
where the frequency shifted matrices are uncorrected, con- 
sisting of entries which are complex, zero-mean, and whose 
real and imaginary parts are normally distributed, it can be 
shown (Appendix A) that 



17 H((L>-m<£> 0 ) 

m= 1 



~ Ft ||H(co-wo> 0 ) 

' v m — 1 



(17) 



This approximation is good for large TV, but is less accurate 
for small TV, becoming least accurate for TV = 2. 



The transfer functions in H have identical real part and 
squared-magnitude co variances of 36 

1 

pOo)= - 



1 + 



13.8 



(18) 



where T is the reverberation time. The covariance is 0.2 for 
u> 0 = 27.6/r and the norm approximation should be reason- 
ably accurate (up to the accuracy governed by TV) for fre- 
quency shifts greater than this value. For a reverberation 
time of 1 s this requires a frequency shift of 4.4 Hz. 
Using the norm approximation, the power gain 



AT 



TV 



2M M 

n 

m = I 



I H( o>— m oj 0 ) 



(19) 



must reduce to zero as M tends to infinity. Writing the ap- 
proximation in dB 

1 0 log( r M ) = M^ dB — M\0 log( N) 

M 

+ 2 101og[||H(o;- W a; 0 )|| 2 ]^-oc (20) 

m= 1 

as M tends to infinity. For large M, the summation tends to 
the scaled mean of the norm in dB, 



lOlogtr^^MtMdB-lO^^+lOlogCH^w)!! 2 ]] 



the stability requirement then becomes 



/x dB < 10 log(TV) - 1 0 log[||H( a,)|| 2 ] . 
In Appendices B and C it is shown that 
10 



(21) 



(22) 



10log[||H(a/)|| 2 ] = 



ln(10) 
10 



ln(10) 



^(TV 2 ) 



2 T- c 



(23) 



where ^ (.) is the psi function, 37 and so the stability limit is 



10 



MdB<101og(7V)- Ii ^y^(TV 2 ). 



(24) 



This is the stability requirement for an TV channel system 
' with identical frequency-shifting in each channel, assuming 
that the norm approximation is accurate. For TV= 1 it yields 
yu dB <2.5 as in the original analysis. 23 The maximum loop 
gain with frequency-shifting reduces with TV. For example, it 
is — 1 2 dB for a 16 channel system. This is in keeping with 
the maximum loop gain per channel without frequency- 
shifting, which also decreases with IV. 

The stability limit also applies to multichannel systems 
with unitary processors in their feedback loops, since a uni- 
tary matrix does not alter the loop power gain. Specifically, if 
X is a unitary matrix, then 

||XH|| 2 =Tr{(XH) t (XH)}-Tr{HtxfXH} = ||H|| 2 , (25) 

where t denotes the conjugate transpose and Tr the trace 
function. 
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E. Stability analysis with finite M 

The analysis in the previous section requires that the 
sound regenerate through the feedback loop a large number 
of times. This requirement is satisfied in wideband systems 
with small frequency shifts. However for bandlimited sys- 
tems, and high-frequency input signals, the number of itera- 
tions M will be smaller than for low frequency signals, it is 
therefore instructive to consider the stability analysis for a 
finite M. The analysis will also be useful for verification of 
the stability equation using Monte Carlo simulations. 

Consider Eq. (20). The stability condition is obtained by 
assuming that the summation tends to the average of the 
norm in dB, scaled by M. An alternative viewpoint is that 
the summation is a random variable (with microphone and 
loudspeaker position) which tends to a normal variable as M 
increases. Provided that oj 0 is large enough each term in the 
sum is independent of the others, and has the form 
101og(v m )> where v m = ||//(£u-/77O) 0 )|| 2 . The squared norm 
y m is ^-squared distributed with n = 2 N 2 degrees of freedom. 
The mean of the squared norm in dB is given by Eq. (23) and 
the variance is (Appendix B) 



o- 2 = var{101og(>>)} = 



10 



ln(10) 



ln( 



-V 

10)/ 



k - 1 k 



(26) 



where K= 1.644924 and f(2JV 2 ) is the Riemann zeta func- 
tion (Appendix C). 

The sum of M terms, by the Central limit theorem, tends 
to a normal density with mean My and variance Mcr 2 . If y 
is scaled by a loop gain /x 2 the probability of instability is 
then 



Pr{inst,H}(MdB>A^) = Pr 



A/-1 

2 101og( M 2 y m ) 

m = 0 



> \0M \og(N) 



(27) 



[this is equivalent to saying that the power gain of the Mth 
term in Eq. (13) is greater than one]. Hence 



Pr{inst,H}( MdB ,/V\M) 



1 1 



erf 



M 1 01o g (A0-M dB -^ *(* 2 ) 
10 



ln(10) 



W 2 ) 



(28) 



where erf is the error function. The 50% risk of instability 
occurs when the argument of the erf function is zero, which 
yields the stability limit in Eq. (24). As M increases, the 
transition from 0 to 1 through the 50% value as jj. increases 
becomes increasingly rapid, and for large M the stability 
limit is sufficient to characterize the stability of the system. 



F. Stability analysis for systems including 
reverberators 

We now consider the case where the second transfer 
function matrix X represents a nonunitary reverberator. 
Many sound systems include reverberators to enhance the 
sound transmitted into the room. This increases the variance 
of the power gain of the feedback loop. In time-invariant 
systems this means that the loop gain must be reduced to 
maintain stability. With frequency-shifting, it has been 
shown that for single channel systems this increased variance 
allows a higher loop gain to be achieved than that without a 
reverberator present. 29 

We will assume that the reverberator has the same sta- 
tistical norm properties as the room matrix H but is scaled by 
]/yjN to have unit power gain, which allows comparison 
with the unitary case. Equivalently, X is statistically identical 
to H and the loop gain is scaled by \/yfN. 

The stability requirement is given by Eq. (22), but the 
matrix H is replaced by XH, and the division by \]N intro- 
duces an additional 101og(AO to the loop gain 



[ / x dB -101og(A r )]<101og(^)-101og[||X(cu)H(a;)|| 2 ]. 

(29) 

Treating H and X as independent, identically distributed ma- 
trix random variables, and using the norm approximation 



101og[||XH|| 2 ]~101og 



^l|X|| 2 ||H|| 2 



= 20 log[||H|| 2 ]-101og(A0 
the stability requirement is 



MdB + 20 log[||H( co)!! 2 ] < 30 log( AO . 



(30) 



(31) 



Substituting for 20 log[||H(<o)|| 2 ] from Appendix B yields the 
stability criterion 



^ dB <301og(A0- 



20 



ln(10) 



V(N 2 ). 



(32) 



The analysis for a finite number of iterations through the 
feedback loop may be carried out in a similar manner to the 
previous case. Applying the norm equality to XH it may be 
shown that the mean with normalized X is 



10 log 



1 



XH 



= -20 log(A0-f 



20 



ln(10) 



V(N 2 ). (33) 



The variance of the norm in dB is unaffected by the \/\jN 
scaling, and is twice that without a reverberator [see Eqs. 
(30) and (26)] 







1 


2^ 


var 


10 log 


-prXH 











= 2 



10 



(ln(10) 

The stability for M iterations is then 



£{2,N 2 ). (34) 
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FIG. 2. Monte Carlo Simulation results for sound sys- 
tem with unitary feedback for 7V= 1 — 32. 
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(35) 

The 50% risk loop gain is then given by the right-hand side 
of Eq. (32). 

III. STABILITY SIMULATIONS 

A. Introduction 

The validity of the theoretical stability limits were inves- 
tigated using two forms of simulation. In the first, Monte 
Carlo simulations were carried out to assess Eqs. (28) and 
(35) for a finite value of M. The 50% risk values of these 
simulations should match the stability limits in Eqs. (24) and 
(32). In the second approach a digital reverberator was used 
to represent a room and feedback was applied from N out- 
puts to N inputs via frequency shifters. A second reverberator 
could be optionally included in the feedback loop to simulate 
the inclusion of a reverberator in a sound system. 

B. Monte Carlo simulations 

The transfer function matrix between a loudspeaker and 
microphone in a room has real and imaginary parts which are 
normally distributed, provided that the direct sound level is 
small compared to the reverberant sound level. The correla- 
tion functions of the real and imaginary parts are as given in 
Eq. (18). Therefore, complex random processes may be gen- 
erated using an autoregressive (AR) process which produce 
the same statistical and correlation properties as the transfer 
functions in rooms. 2 38 

If an N channel sound system is used, and the micro- 
phones and loudspeakers are further away from each other 



than the spatial correlation distance, 39 then independent au- 
toregressive processes with the same statistics and correla- 
tion properties may be used to model the transfer function 
matrix. 

Hence, to determine the stability of multichannel sound 
systems with frequency-shifting, an AR process is used to 
produce a set of transfer function matrices at equally spaced 
frequencies. The set of transfer function matrices are scaled 
by the loop gain. The values of the scaled transfer function 
matrices at M frequencies spaced at / 0 , where f 0 is the 
frequency shift, are multiplied together. The squared norm of 
the product is then found and the power gain calculated. If 
the power gain exceeds one, the system is assumed to be 
unstable. The simulation is then repeated with a new AR 
process. The probability of instability at the given loop gain 
may be determined from K such simulations, and the vari- 
ance calculated. 2 For systems including reverberators, a sec- 
ond AR process is used to simulate the reverberator. 

Monte Carlo simulations are shown in Fig. 2 for a room 
with a reverberation time of 1 second, for numbers of chan- 
nels in powers of 2 from 1 to 32 and for M=50. The number 
of trials was 400. The frequency shift was 10 Hz, for which 
the correlation between samples is 0.05. The predicted sta- 
bility risk is closely matched by theory for N= 1, where the 
norm approximation is exact. For N= 2 the theoretical 50% 
value is 0.5 dB too low. For N=4 and 8 the error is 0.4 and 
0.25 dB, respectively. For N= 16 and 32 the error is negli- 
gible. Hence, the norm approximation produces a worst-case 
error of half a dB for /V= 2 and the error reduces with TV. 

For smaller frequency shifts, the assumption of indepen- 
dent variables in the summation of Eq. (20) becomes less 
correct. The slope of the resulting stability curve reduces 
slightly, diverging from the theoretical curve predicted by 
Eq. (28) for small and large probabilities, and particularly for 
small /V where the slope is lower. However, the deviation is 
small for probabilities near 50% and the 50% stability limit 
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Loop gain (dB) 



remains close to the theoretical stability limit predicted by 
Eq. (28). 

The equivalent simulations for a sound system including 
a multichannel reverberator with natural statistics are shown 
in Fig. 3. The N= 1 results are again accurate. The errors for 
JV=2, 4, 8, and 16 are 1.3, 0.9, 0.5, and 0.25 dB, respec- 
tively. These are larger than the results without a reverbera- 
tor, because the norm approximation was applied twice to 
produce the probability of instability. 

C. Reverberator-based simulations 

In order to further investigate the accuracy of the stabil- 
ity limits derived above, a second simulator was developed. 
The transfer function matrix of a room was simulated as a 
time- in variant, 24-channel reverberator using 24 delay lines 
cross-coupled via an orthonormal matrix. 7 This provided a 
fully cross-coupled 24X24 matrix of transfer functions with 
Rayleigh magnitude statistics. Feedback was then applied 
around 7V<24 channels of the reverberator and frequency- 
shifting was implemented in the feedback loop. A second 
reverberator was implemented in the feedback loops as re- 
quired. This second reverberator could either simulate the 



Rayleigh statistics of rooms, or could be of a unitary design. 
The majority of simulations were earned out with no second- 
ary reverberator, or with a standard nonunitary reverberator. 
Some unitary reverberator simulations were later run to 
check that the stability limits werp the same as the no- 
reverberator case. 

A block diagram of the simulator is shown in Fig. 4. As 
discussed, this simulator uses an analytic input signal, and so 
the analytic filter A(a>) is placed at the input to eliminate the 
negative frequencies in any applied real signal. The signals 
in the reverberator are thus complex. The envelope of a 
given output is obtained from its low-pass filtered magni- 
tude, and the real part of two adjacent channels written to a 
stereo wave file for subjective assessments, discussed in Sec. 
IV. 

The continuous-time analytic filter A{o>) has the theo- 
retical complex impulse response 

In practice, a discrete, finite response, bandlimited form of 
the analytic filter is required. The finite response means that 
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FIG. 4. Reverberator-based frequency shifting simula- 
tor. X{ o>) is bypassed for simulations without a second- 
ary reverberator. 
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FIG. 5. Magnitude of the analytic filter response. 
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the analytic filter has a low frequency cutoff below which the 
magnitude of the response reduces. A 5 12 tap finite impulse 
response filter was designed by specifying a desired complex 
filter response with a gain of two for positive frequencies and 
a linear transition from a transition frequency of 100 Hz to 
an attenuation of — 40 dB at zero frequency. The filter taps 
were then obtained from a least squares fit to the desired 
response. The resulting magnitude response is shown in Fig. 
5. 

The ''naturalness" of the reverberators may be verified 
by showing that they produce ideal statistics in the frequency 
domain, and that they have a high echo density in time with 
an absence of flutter effects. The squared norm of the rever- 
berator transfer function matrices were close to the theoreti- 
cal probability densities for all N. For example, the mea- 
sured probability density of the squared norm of the transfer 



function matrix for 8 channels is shown in Fig. 6 together 
with the theoretical ^-squared density. The measured density 
is closely similar to the theoretical. 

The echo density was evaluated by listening to the im- 
pulse response. The response sounded smooth without any 
noticeable flutter effects. The envelope of the decay was ex- 
ponential and matched the required RT. No damping was 
applied to the reverberator, so that its reverberation time was 
constant with frequency. A reverberation time of 1 .0 s was 
used in both the main and the secondary reverberator. The 
theoretical stability limit is not affected by the RT, and the 
RT of the reverberators was controlled by altering their in- 
ternal delay times which did not affect their norm statistics. 
A frequency shift of 5 Hz was used, producing a correlation 
between adjacent samples of 0.16. The sample rate was 44.1 
kHz, and first-order 20 Hz high pass and 20 kHz low pass 




Magnitude 
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FIG. 6. Theoretical and measured probability density of 
the power gain of an 8 channel reverberator transfer 
function matrix. 
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FIG. 7. Set of output envelopes for a single channel 
system without reverberator and with loop gains from 1 
dB to 3 dB in 0.25 dB steps. 



filters were implemented in each channel to prevent the ap- 
pearance of negative frequency components. 

Simulations were run for N= 1, and for even channels 
from 2 to 16, and for a number of loop gains from below to 
above the relevant theoretical stability limits in Eqs. (24) and 
(32). A duration of 60 s was used to allow the envelope to be 
observed for a considerable time. For high numbers of chan- 
nels the decays were linear and 30 s were sufficient to estab- 
lish stability, in all cases the input was a unit delta function 
which, upon being filtered by the analytic filter, produces the 
analytic filter impulse response. 

For low 7 channel numbers the loop gain was incremented 
in 0.25 dB steps, but for some of the higher channel numbers 
more closely spaced increments of 0.1 and 0.05 dB were 
possible. For each simulation the squared magnitude of the 
analytic output signal was found. This was low-pass filtered 
and then decimated by a factor of 128 to reduce the data size. 
The set of power envelopes for the range of loop gains was 
then plotted. The stability limit was estimated as the loop 
gain which produced the envelope with slope closest to zero. 

A set of power envelopes obtained for N~ 1 are shown 
in Fig. 7, for loop gains from 1 to 3 dB in 0.25 dB steps. The 
envelope initially rises for all loop gains, but then reduces 
with time for loop gains below 2 dB, suggesting that 2 dB is 
the stability limit. This is close to the theoretical limit of 2.5 
dB. Nielsen and Svensson produce a maximum loop gain of 
0 dB from measurements. 29 However, they noted that large 
temporary level increases occurred which saturated their 
equipment, particularly for low frequency shifts. Figure 7 
verifies that these large increases occur even for the rela- 
tively large shift of 5 Hz (and larger level increases were 
found for simulations with lower frequency shifts). This sug- 
gests that, in practice, the 2.5 dB limit is not achievable 
because any sound system will have insufficient dynamic 
range to handle the signal amplitude at such loop gains. It is 
also highly likely that the sound quality will be unacceptable 
at loop gains approaching the stability limit in any case. 

A second simulation for N~ 1 is shown in Fig. 8, using 



lower loop gains ranging from — 1 to -f 1 dB, and looking at 
the first 2 s of the response. The output envelope slope is 
zero or negative for loop gains of 0 dB and lower, and so 
saturation of audio systems is unlikely. Hence 0 dB is a 
practical stability limit, and is consistent with the results 
measured by Nielsen and Svensson. 

The output envelopes for a 16 channel system with no 
secondary reverberator are shown in Fig. 9, for loop gains 
ranging from - 12.15 to - 1 1.85 dB in 0.05 dB steps. In this 
case, there is no large initial buildup of level. The envelopes 
are approximately linear with time, and the stability limit is 
in the vicinity of - 1 1 .95 dB, which is 0.08 dB from the 
theoretical value of — 12.03 dB. The simulations in this case 
produce a stability limit very close to the theoretical predic- 
tion. 

The complete set of simulation-derived and theoretical 
stability limits without a reverberator are shown in Fig. 10. 
Also included are Monte Carlo results for the 50% risk loop 
gain. The N= 1 simulator result is 0.5 dB lower than theory, 
whereas the Monte Carlo result is closer. Since both simula- 
tions include the correlations between samples of the transfer 
function matrix, finite correlations between adjacent products 
does not explain this discrepancy. It may be explained by the 
approximation of the sum in Eq. (20) by the exact mean. For 
N~2 both simulations are about 0.5 dB higher than theory 
and the error tends to reduce with N. Also included in Fig. 
10 is the 50% risk of instability limit without time variation 
found using the methods in Ref. 2. This allows the theoreti- 
cal improvement in loop gain produced by frequency- 
shifting to be determined. For N= 1 the theoretical increase 
is about 1 1 dB and as /V increases the improvement reduces 
to around 3 dB for the 16 channel case. Thus, it may be 
concluded that time-variation produces a reducing benefit 
with the number of channels. The reason for this is that for 
large N 9 the statistics of the loop power gain become increas- 
ingly ideal. From Eqs. (B3) and (B4), the mean of the 
squared norm of H is N 2 (for 2cr^ = 1) and the variance is 
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FIG. 8, Set of output envelopes for a single channel 
system without reverberator and with loop gains from 
- I dB to 1 dB in 0,25 dB steps. 
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Time (s) 



N 2 /4. The relative deviation of the squared norm may be 
quantified by the ratio of the standard deviation to the mean, 
which is 



1 

2N' 



(37) 



The deviation of the loop power gain thus reduces with IV, 
and there is a reducing variation for time variation to exploit. 

Simulations were also carried out which included a non- 
unitary reverberator with the same statistics as those of a 
room. The output envelopes showed a reduced slope varia- 
tion with loop gain compared to the no-reverberator case. An 
example for 16 is shown in Fig. 11 for loop gains vary- 
ing in 0.1 dB steps (twice the stepsize of that in Fig. 9), The 
envelope slopes have about half the variation with loop gain 
of those in Fig. 9. This is explained by Fig. 3, which shows 



a slower transition from stability to instability compared to 
Fig. 2. The stability limit is about — 11.8 dB. Because of the 
slower transition from instability to stability, time varying 
systems with nonunitary reverberators will require a larger 
stability margin than systems without reverberators. 

The complete set of stability limits for systems with 
nonunitary reverberators is shown in Fig. 12. For N= 1 the 
Monte Carlo result is closer to theory than the reverberator 
simulation, but the remaining results follow a similar trend, 
with around 1 dB error for N=2 and a reducing error with 
jV. However, the closeness of theory and measurement is not 
as good as the previous case, due to the additional norm 
approximation required in the derivation of the theoretical 
stability limit. 

The stability limits for the case of no time variation are 
also included in Fig. 12. (These values are discussed more 




FIG. 9. Set of output envelopes for a sixteen channel 
system without reverberator and with loop gains from 
- 12.15 dB to - 11.85 dB in 0.05 dB steps. 
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FIG. 10. Stability limits for multichannel sound sys- 
tems without reverberators, including Monte Carlo re- 
sults and theoretical time-invariant loop gain limits for 
50% risk of instability. 



fully in Sec. HID.) The results show that frequency-shifting 
allows the mean loop gain to be increased by 1 8 dB for the 
single channel case, which is 7 dB larger than the case with 
no reverberator. 

For N> 1 the maximum loop gain limits are approxi- 
mately the same for systems with and without reverberators, 
and the two limits become closer as N increases. The in- 
creased loop gain enhancement for systems with reverbera- 
tors at low channel numbers occurs because these systems 
have an inherently poorer performance without frequency- 
shifting, due to the greater variance of their loop transfer 
functions. Time-variance allows these systems to bring their 
loop gains up to the same levels as unitary systems, and to 
exceed them at low channel numbers. 

The calculation of the theoretical loop gain limits in Sec. 
II requires that the same frequency shift is used in all chan- 
nels. In practice, different frequency shifts could be used. To 



investigate this case a simulation was run for a 16 channel 
system with the same parameters as in Fig. 9, but with nor- 
mally distributed frequency shifts varying from 3.27 Hz to 
6.31 Hz, with a nominal m^an of 5 Hz. The power envelopes 
are shown in Fig. 13, which may be compared to the enve- 
lopes in Fig. 9. Each decay curve has' a slightly increased 
slope, and this is probably due to the fact that the frequency 
shift in some channels is lower than 5 Hz. However the 
stability limit , of about - 12.02 dB is only slightly higher 
than the — 1 1.95 dB limit in Fig. 9. Hence the stability limit 
does not appear to be significantly affected by using different 
frequency shifts per channel. However, if too many channels 
have low frequency shifts, it is likely that the stability per- 
formance will degrade more significantly. Furthermore, high 
frequency shifts will produce more noticeable modulation 
artefacts. 

In practical multichannel sound systems, variable rever- 




-40 



FIG. 1 1 . Set of output envelopes for a sixteen channel 
system with reverberator and with loop gains from 
- 12.1 dB to - 1 1.5 dB in 0.1 dB steps. 
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FIG. 12. Stability limits for multichannel sound sys- 
tems including nonunitary reverberators, including 
Monte Carlo results and theoretical time -invariant loop 
gain limits for 50% risk of instability. 



beration levels with distance mean that the loop gain in each 
channel may not be the same. 42 In the time-invariant case, 
this increases the variance of the characteristic functions, 2 
and leads to a reduction in stability margin. Singulations were 
therefore undertaken in which the loop gains varied uni- 
formly from — cr dB to cr dB dB about the chosen value. The 
loop gains were then normalized so that the linear mean 
equalled the chosen value. Figure 14 shows the output de- 
cays for a sixteen channel system with — 3 to 3 dB variation 
in loop gain for each case. The stability limit is close to 
- 12.05 dB, which is only 0.1 dB higher than the - 1 1.95 dB 
limit in Fig. 9. This shows that frequency-shifting allows the 
stability limit with moderate loop gain variations to approach 
that for the mean loop gain across channels. However, the 
stability is likely to be more significantly affected for larger 
loop gain variations. For example, if one loop gain is set to 
zero the system has N— 1 channels and Fig. 10 shows that in 



this case the mean loop gain limit will increase. 

Finally, simulations were also undertaken in which the 
secondary reverberator was unitary, for 7V= 16 and N=S 
channels. A set of output envelopes for N = 16 are shown in 
Fig. 15. The limit is closely similar to the decays in Fig. 9, 
with a stability limit of — 12 dB. This is to be expected since 
for /V= 16 both unitary and nonunitary reverberators produce 
the same stability limit. The variation in the decay slopes 
with loop gain is slightly reduced compared to the no- 
reverberator case but is not as low as the standard reverbera- 
tor case, however the variation in each unitary envelope is 
smaller than those of the no-reverberator case. This may be 
due to the fact that the unitary reverberator is diffusing the 
feedback signals from the primary reverberator, which re- 
duces the effect of dominant feedback paths that occur with 
no reverberator. The envelope fluctuations are also smaller 
than those of the nonunitary reverberator which is probably a 




FIG. 13. Set of output envelopes for a sixteen channel 
system without reverberator, with loop gains 
- 12.15 dB to - 11.85 dB in 0.05 dB steps and with 
different frequency shifts in each channel. 
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FIG. 14. Set of output envelopes for a sixteen channel 
system without reverberator, with loop gains 
- 12.15 dB to - 11.85 dB in 0.05 dB steps and with 
loop gain variation - 3 to + 3 dB across channels for 
each case. 



-50 



result of the reduced variance in the unitary case. 

The N=S channel results were also similar to the no- 
reverberator case with an estimated stability limit of between 
-r 8.8 and - 9 dB in both cases, close to the -9 dB theoret- 
ical limit. 



D. Stability of time-invariant multichannel systems 
with reverberators 

In order to assess the effectiveness of frequency- 
shifting, the stability limits must be compared to those ob- 
tained for multichannel time-invariant systems. 23 The stabil- 
ity limits for unitary systems have been derived in Ref. 2, 
and included in Fig. 10. However, the stability results for 
multichannel sound systems with a reverberator in the feed- 
back loop — which require knowledge of the statistics of the 
eigenvalues of the product of two complex normal 



matrices — are unknown. The risk of instability (and the 50% 
limits used in Fig. 12) are determined empirically here using 
Monte Carlo simulations. 

The loop transfer function matrix in systems including 
reverberators is the product of the room transfer function 
matrix and the reverberator matrix. Therefore, the stability of 
these systems may be estimated by generating a separate 
multichannel autoregressive process representing each ma- 
trix. The eigenvalues of the matrix product at each frequency 
are determined, and if the real part of any eigenvalue exceeds 
one, the system is assumed to be unstable. 

For the single channel case, the calculation of eigenval- 
ues is not required and a theoretical stability limit can be 
calculated. In Ref. 2 the stability of a single channel system 
with secondary reverberator was determined using a magni- 
tude analysis. In Appendix D the real part analysis is carried 




FIG. 15. Set of output envelopes for a sixteen channel 
system with a unitary reverberator and with loop gains 
from - 12.15 dB to - 11. 85 dB in 0.05 dB steps. 
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FIG. 16. Monte Carlo simulations of the probability of 
instability for time-invariant systems with nonunitary 
reverberator feedback: N=l(0) t N=2(X), N 
= 4(0), vV=8(*), and N= 16(0). 
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out, which is more accurate. The resulting theoretical limit of 
— 13.6 dB was closely matched by the Monte Carlo simula- 
tion result of — 13.5 dB. 

The probability of instability for channels in powers of 
two up to 16 is shown in Fig. 16, and — for comparison — the 
Monte Carlo results for the unitary case are shown in Fig. 17. 

Note first that the unitary 50% stability limits have a 6 
dB range from -9dB (A^=l) to -15.2dB (N~ 16), 
whereas the nonunitary results have a reduced range of 3 dB 
from -13.5dB to -16.6dB. The two results become in- 
creasingly similar for increasing N, but at low channel num- 
bers, unitary systems show an increasing 50% stability limit 
over the nonunitary case of up to 4.5 dB for N= 1. (This 4.5 
dB difference using the real part analysis is more accurate 
than the magnitude-based analysis in Ref. 2, which predicted 
a 5 dB difference between unitary and nonunitary cases.) The 



reduced performance of nonunitary systems is due to the 
larger variance in the eigenvalues of these systems, particu- 
larly at low channel numbers. 

Secondly, the nonunitary results have a reduced slope 
with loop gain, showing a higher risk of instability for a 
given number of dB below the 50% limit than the unitary 
case. In other words, nonunitary time-invariant systems are 
likely to require a larger stability margin than unitary sys- 
tems. This is again due to the increased variance of the ei- 
genvalues, and is similar to the FS results in Figs. 2 and 3. 

Thirdly, the stability limits for the non-unitary case are 
approximately the same for N= 1 and 7V= 2 channels. This is 
due to the fact that the real part of the eigenvalues of the 
N=2 system have similar statistics to the real part of the 
N— 1 case. 
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FIG. 17. Monte Carlo simulations of the probability of 
instability for time-invariant systems with unitary rever- 
berator feedback. 
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TABLE 1. Time -in variant, unitary feedback. 



N 


50% Limit 


Impulse 


Music 


LYL ] I1JIUUJ H 


LVI Cll J^J 1 I 


1 


-9.0 


-15(2.5) 


-14(5) 


-15 


6 


2 


- 10.2 


-15(2) 


-15(3) 


- 15 


5 


4 


-11.6 


-17(1) 


-17(4) 


- 17 


6 


8 


- 13.3 


-19(2.5) 


-20(3) 


-20 


7 


16 


- 15.2 


-21 (1.5) 


-21 (3) 


-21 


6 



IV. PRELIMINARY ASSESSMENT OF SUBJECTIVE 
LOOP GAIN LIMITS 

Sound systems must always be operated at some margin 
below the absolute stability limit to sound acceptable. The 
theoretical limits derived above do not provide sufficient in- 
formation to determine these loop gain limits. Subjective 
tests were therefore carried out to estimate them. Both im- 
pulse response and music sound sources were used, to allow 
the assessment of both running music and transient decays. 
For the frequency-shifting simulations a 5 Hz shift was used. 
The same simulator was used for the time-invariant case with 
a frequency shift of 0 Hz. 

The reverberation times in both reverberators were kept 
constant with frequency. This simplification allowed the sub- 
jective results to be compared directly with the stability lim- 
its, which also assume constant reverberation time with fre- 
quency. Three researchers made assessments of the 
acceptable loop gains with impulse responses, and four re- 
searchers assessed the music simulations. The results were 
averaged and rounded to the nearest decibel. The number of 
subjects is relatively small, and the results are therefore in- 
dicative only. A more rigorous subjective analysis would be 
required to produce more accurate results, but such tests are 
beyond the scope of this paper. 

Stereo impulse responses were generated from adjacent 
outputs from the reverberator for N=\ to 16 channels in 
powers of 2. For each N, five sets of simulations were car- 
ried out for the time- invariant case, with different randomly 
generated matrices in the reverberators to produce a range of 
ringing conditions. One of these impulse responses was used 
to filter the music excerpt to produce a set of music samples. 
For the time-varying case three simulations were done, with 
the assumption that the results would be more homogeneous 
due to the averaging of the transfer functions. For each simu- 
lation, the impulse response was generated for a range of 
loop gains from near the stability limit to several dB below. 

For the music simulations, a 10 s excerpt from Handel's 
water music was used as the source. 40 This includes a bass 
note of close to 82 Hz (E2) which allows the low frequency 
performance to be heard. 



TABLE II. Time -invariant, nonunitary feedback. 



N 


50% Limit 


Impulse 


Music 


Minimum 


Margin 


1 


- 13.5 


-21(2) 


-18(6) 


-21 


8 


2 


- 13.6 


-21(2) 


"20(7) 


-21 


8 


4 


- 14.1 


-21 (2) 


-20(5) 


-21 


7 


8 


- 15.2 


-23(1) 


-22(4) 


-23 


8 


16 


- 16.6 


-23(2) 


-23(3) 


-23 


6 
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TABLE III. Frequency-shifting, unitary feedback. 



N 


50% Limit 


Impulse 


Music 


Minimum 


Margin 


I 


2.5 


-8(2) 


-12(8) 


-12 


15 


2 


-2.4 


-10(1) 


-14(4) 


- 14 


12 


4 


- 5.9 


-11 (1) 


-15(1) 


- 15 


9 


8 


-9.0 


- 13 (1) 


-18(2) 


-18 


9 


16 


- 12.0 


- 16(2) 


-21(1) 


-21 


9 



The time-invariant stereo impulse responses had cross- 
correlation coefficients less than 0.5 and so were reasonably 
binaurally dis-similar. The stereo wave files were evaluated 
using headphones. 

A. Results 

The subjective loop gain limits determined from the im- 
pulse responses and music are shown in Tables 1-IV and the 
estimated increases in loop gain due to frequency-shifting 
are shown in Table V. Also given in Tables 1-IV are the 
range of the subjective loop gain estimates in parentheses. 
The ranges are typically greater for the music samples, show- 
ing that artefacts are more difficult to discern with a continu- 
ous and complex music signal. 

The subjective loop gain limit for the time- invariant uni- 
tary N=\ case with music is — 14 dB, whereas the limit 
with impulses was - 15 dB. These results are, respectively, 2 
and 3 dB lower than the -12 dB often quoted for single 
channel systems. 1 One reason for this is that these results are 
from simulations with flat loop gains and reverberation times 
with frequency, whereas the results in Ref. 1 are based on 
listening tests with actual sound systems, where the loop 
gain tends to reduce with frequency due to loudspeaker 
power responses and air absorption. In a simulation with 
damped reverberation times in both reverberators, but with 
the same low frequency loop gain, the ringing was subjec- 
tively reduced. Hence loop gains closer to — 12 dB are more 
likely to be achieved in practical installations. 

For the time- invariant unitary case the minimum subjec- 
tive loop gain limit for Af = 1 is - 1 5 dB and with frequency- 
shifting it is — 12 dB. This 3 dB improvement is smaller than 
Schroeder's value of 6 dB. 23 However, the loop gain increase 
for the impulse case is 7 dB which is closer to Schroeder's 
value. This suggests that music signals offer a more stringent 
test for the quality of sound with frequency-shifting and that 
smaller loop gain improvements are possible with music than 
for signals with fewer continuous tones or less low frequency 
content, such as impulsive or speech signals. 

For increasing numbers of channels, the available in- 
crease in loop gain tends to be reduced for both unitary and 
nonunitary cases, aside from the anomalous reduction to 1 



TABLE IV. Frequency-shifting, nonunitary feedback. 



N 


50% Limit 


Impulse 


Music 


Minimum 


Margin 


1 


5 


-14(4) 


-13(8) 


-14 


19 


2 


-1.9 


-15(3) 


-14(5) 


-15 


13 


4 


-5.8 


-15(2) 


-15(2) 


-15 


9 


8 


-8.9 


-17(1) 


-18(1) 


-18 


9 


16 


- 12.0 


-18(2) 


-21 (1) 


-21 


9 
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TABLE V. Loop gain improvement due to frequency-shifting. 



V. CONCLUSIONS 



N Unitary loop gain increase Nonunitary loop gain increase 



1 3 7 

2 1 6 
4 2 6 
8 2 5 

16 0 2 



dB for the jV= 2 unitary case, which is attributed to the range 
of subjective estimates for the music samples and the rela- 
tively small number of subjects used. 



B. Discussion 

The subjective limits determined here show that — while 
frequency-shifting provides significant increases in theoreti- 
cal loop gain — the gain increases that can be achieved with- 
out introducing artefacts are smaller. This is consistent with 
the results in Ref. 23. Furthermore, in keeping with the the- 
oretical stability limits, the subjective potential gain due to 
frequency-shifting reduces with the number of channels. For 
example, for unitary systems with 16 channels,' the time- 
invariant loop gain and frequency-shifting limits are both 
about -21 dB and no benefit is produced by FS. For the 
nonunitary 16 channel case the benefit is only 2 dB. This 
lack of loop gain improvement is due to the pitch-shifting of 
low frequencies that occurs in music which limits the loop 
gain to well below the theoretical value. If the frequency 
shift was reduced at low frequencies, the loop gain could be 
increased, producing some possible benefit. 

Low frequency pitch-shifting artefacts may be reduced 
by using other time-variation methods such as delay- 
modulation, which produces lower pitch shifts at low fre- 
quencies. However, this also means that the low frequencies 
are not shifted sufficiently to produce the stability limits that 
frequency-shifting produces. In essence, at low frequencies, 
the requirement for low pitch -shifting means that the system 
becomes quasi-time-invariant, leading to low frequency in- 
stability, as has been reported by Nielsen and Svensson. 29 
Such systems could be viewed as a low frequency time- 
invariant part and a high frequency time-varying part. The 
stability limits then become a complicated function of the 
two contributing parts. 

Other time-variation systems may introduce both ampli- 
tude and phase modulation. Amplitude modulation has the 
effect of modulating the loop gain at each frequency, 
whereas phase modulation shifts each frequency by (in gen- 
eral) a varying amount with time. The subjective effect of 
such time variation will undoubtedly differ from that pro- 
duced by frequency-shifting. 

Therefore, while the theoretical maximum loop gains 
derived here provide an upper bound for any time-varying 
system, the subjective stability limits determined apply spe- 
cifically to frequency-shifting, and it is expected that differ- 
ent subjective limits would be obtained for other forms of 
time-variation. 
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This paper has determined the theoretical stability limits 
of multichannel sound systems which employ frequency- 
shifting and compared the results with the time-invariant 
case. Frequency-shifting produces the best-case stability for 
time-varying systems since it produces infinite carrier sup- 
pression and one-sided sidebands which avoids shifting fre- 
quencies back to the carrier. It therefore provides an upper 
limit to the loop gain increase that may be expected from the 
application of time-variation methods in sound systems. 

The derived stability limits show that the improvement 
in loop gain produced by frequency-shifting reduces with the 
number of channels, TV, since the power gain statistics be- 
come increasingly ideal with N. Multichannel sound systems 
with additional nonunitary reverberation systems have an in- 
herently lower stability limit without time-variation, and 
therefore time-variation provides a greater improvement in 
loop gain for these systems, particularly at low channel num- 
bers. 

Preliminary estimates of the subjective loop gain limits 
have also been made, based on both impulse responses and a 
music sample. The results show a reducing benefit of 
frequency-shifting with channel numbers, and are consistent 
with the theoretical limits. 

Simulations have been included for the case of different 
frequency shifts and loop gains in each channel. These show 
that the theoretical stability limit is robust to such variations, 
although wide variations of frequency shift or loop gain can 
be expected to produce a degradation in performance. The 
subjective assessment of these variations has not been inves- 
tigated. 

The theoretical stability limits have been derived assum- 
ing ideal room transfer functions with zero-mean complex 
normal statistics (Rayleigh magnimdes), for the case of both 
unitary and nonunitary (Rayleigh) feedback. The analysis 
could be extended to include the case where one of the room 
transfer function matrices has nonzero means (Ricean mag- 
nitudes), producing a squared matrix norm which is noncen- 
tral ^-squared distributed. 41 This case has not been consid- 
ered here, but the normalized Ricean case will produce a 
stability limit between the unitary and Rayleigh cases, 2 and 
at high channel numbers, the two stability limits become 
identical. 

Practical sound systems have frequency-dependent loop 
gains due to factors such as air absorption and loudspeaker 
power responses. Since frequency-shifting produces a stabil- 
ity limit governed by the mean loop gain, it is possible that 
time-variation will produce greater benefits in practical sys- 
tems due to the increased loop gain variance. However, the 
peak loop gains will also be higher, which may produce in- 
creased subjective artefacts at frequencies near the peaks. 
Furthermore, if the loop gain has wide variations with fre- 
quency, equalization would be more appropriate than the use 
of time-variance. 

Frequency-shifting — while it produces the maximum 
possible loop gain increase — may not be subjectively opti- 
mum, since it produces significant pitch-shifting artefacts at 
low frequencies which limit the loop gain increase. Other 
time-variance techniques, while producing lower theoretical 
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stability limits, may be able to operate with smaller loop gain 
margins and sound more natural, although it remains unclear 
to what extent low frequency stability may be controlled 
with any time-varying system. However, since the maximum 
loop gain increase due to time-variance reduces with the 
number of channels, the potential increase for any time- 
varying system is limited for systems with large numbers of 
channels. 



APPENDIX A: NORM EQUALITY 

Consider an /VX TV matrix H with entries which are com- 
plex random variables which are zero-mean, normally dis- 



tributed with identical variances cri in the real and imaginary 
parts. The total variance for each complex r.v. is a — 2cr R . 
The expected value of the Frobenius norm of H is 



£{||H|| 2 } = £ 



X 2 \n m 



n =■ 1 m— I 

N 



= E2 E{\H nm f} = N*<r\ 

n — 1 m — 1 



(Al) 



Consider now a product of M independent, identically 
distributed matrices 



(A2) 



The elements of H M each consist of a sum of N M ~ 1 terms, 
and each term is a product of M terms. The variance of a 
product of M zero mean terms has a variance which is the 
product of the individual variances, or <j 2M . The sum of 
N M ~ 1 of these terms has a variance of N M ~ ! cr 2M . Therefore 
the norm of H /W is 

^{llHj 2 }-^ 1 ^. (A3) 
The norm of a single matrix raised to the power of M is 

E{\\\\ 2 Y f =N 1M <j 2M . (A4) 
Hence 



1 



^{l|HJ| 2 }-^T^{|iHf}^ 



(A5) 



The norm approximation is obtained by removing the expec- 
tation 



IIHJ 2 



1 



M 



* >n = I 



|H„ : 



(A6) 



since taking the expected values of both sides produces Eq. 
(A5). 



APPENDIX B: RELATIONSHIP BETWEEN THE MEAN 
OF THE LOG AND LOG OF THE MEAN 
OF ^-SQUARED VARIABLES 

Let H be an NX /V matrix with elements which are zero- 
mean, normally distributed, with real and imaginary part 
variances cr)-cr 2 R . The squared Frobenius norm of H is 



N N 



(Bl) 



where H nmR is the real part of H nm and H nml the imaginary 
part. The norm thus consists of a sum of 2/V 2 normal random 
variables squared. The probability density of the sum is x 
squared with 2N 2 degrees of freedom 41 



< l(y) - 2 ^ r(N 2y 

The mean of y is 

E{y} = 2<r 2 R N 2 . 
The variance of y is 

varW = (2^) 2 A^ 2 . 



(B2) 



(B3) 



(B4) 



Consider now the mean and variance of z = ln(y). The mean 

.34 



of z can be calculated using the formula 
Jo 

where g(y) = \n(y). Substituting for g(y) and p(y), 



(B5) 



2 n ~<t^ T(N 2 ) 



Using the integral from Ref. 37 [4.352] 



(B6) 



r, 

Jo 



y"e t& y\ti{y)dy= -77+T 
0 M 



1 1 1 
! + -+ - + •••+ C 



-ln(/*) 



(B7) 



where C = 0.577215 is Euler's constant, this may be simpli- 
fied to 



E{\n(y)} = 



1 



+ ln(2cr 2 ) 



= ^(W 2 ) + ln(2<4), 



(B8) 



where U (.) is the psi function (Appendix C). For unit power 
gain transfer functions 2<r R = 1 and E{\r\(y)} = s lr(N 2 ). 

The variance of z = ln(y) requires the expected value of 
z 2 , which may be found by substituting (ln{») 2 in Eq. (B6) 
Using Ref. 37 [4.352] 



Jo 



, v- 1 „ ~ M-V 



r(v), 



e -*(ln(x)) 2 c/.v= ;r{[*(v)-ln(M)] 2 



(B9) 



+ «2,v)}, 

where f is the zeta function (Appendix C), yields 

E{(in(y)) 2 }^a2,N 2 )^(^(N 2 )) 2 , (BIO) 

Hence the variance for unit power gain is 

var{In(y )} = f (2,7V 2 ) . (B 1 1 a) 

The difference between \n(E(y)) and E(\n(y)) is 
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\n(E(y))- E(\n{y)) = 2 ln(AO -W(N 2 ) 



= 2 ln(/V)" 



= 1 k 



(Bllb) 



In dB, 101og(£{y}) = 20Iog(A0 for unit power gain , and so 
the differen ce between 10 log[||H'(co)|| 2 ] and 
10log[||H(a>)|| 2 ] is 



A(A0 = 201og(A0- 



10 



ln(10) 



^(7V 2 ) 



= 20log(N)- 



10 



ln(10) 



2 i-c 



(B12) 



APPENDIX C: PSI AND ZETA FUNCTIONS 

The psi function has the series definition 37 
1 1 



x + k k+\ 



(CI) 



where C= 0.577215 is Euler's constant. For integers x = n 
this becomes 

W^-c+S X- (C2) 

The Riemann Zeta function has the series definition 37 
1 



For integers z — n and </ = m this can be written 
f(/?,m)=2 2 py- 

Ar= 1 k k= I « 

For n = 2 the infinite sum is approximately 



A: =2 77= 1.644924 



and 



(C3) 



(C4) 



(C5) 



(C6) 



APPENDIX D: STABILITY OF A SINGLE CHANNEL 
SOUND SYSTEM WITH REVERBERATOR 

Consider a single channel sound system with a room 
transfer function between the power amplifier input and mi- 
crophone preamplifier output H{oj) and a reverberator with 
transfer function X(a>). We assume that both of these have 
real and imaginary parts with are zero-mean normal with the 
same variances a 2 R ~0.5, so that they have unit power gain. 
The preamplifier output is then scaled by jul to control stabil- 
ity. The loop gain transfer function (excluding fj£) is HX. For 
stability, the Nyquist theorem requires that the locus of HX 
does not encircle the point 1 +J0. For a statistical analysis 
we assume this is equivalent to saying that the real part of 
HX does not exceed 1 . 



The real part of the loop transfer function is 

Re{ HX) = H R X R - HjX } , (D 1 ) 

where the subscripts R and / denote real part and imaginary 
part, respectively. This term is the sum of two products of 
normals. It may be shown that the pdf of a product of normal 
random variables z — xy each with the same variance a' R has 
the form 41 



7TO-P 



and the characteristic function is 
1 



(D3) 



The sum of two products of normal random variables v 
— x 1 v 2 + -V2> ; 2 therefore has the characteristic function 

1 



and the resulting pdf is 
1 



p{v)= rr e 
2a R 



\v\/v 2 R 



The cumulative distribution is 

2 



C(v) = 



1 - 



i via a 



u<0 
u^0' 



(D4) 



(D5) 



(D6) 



Assuming that the transfer function is sampled with spacing 
A/ sufficient to produce uncorrected samples, across a 
bandwidth /?, the total number of uncorrected samples is 
B/Af, and the risk of instability with loop gain /a is 2 



Pr{inst}= 1 



B/Af 



Bl\f 



(D7) 
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